- Political engineering 101: The biz-savvy IT leader’s survival guide
- Huawei set to ship 910C AI chips at scale, signaling shift in global AI supply chain
- The Cyber War on Democracy: Lessons from the 2024 RNC Email Hack
- Los CIO destacan las oportunidades de negociación ahora que AWS y Google reducen los costes de la nube
- 개념? 아키텍처? 기술?··· 데이터 메시 vs. 데이터 패브릭 vs. 데이터 가상화 이해하기
First combined AI-RAN network from Nvidia and SoftBank supports inferencing, claims return of $5 for every $1 invested

Bringing AI as close as possible to enterprise
SoftBank performed an outdoor trial in Japan’s Kanagawa prefecture in which its AI-RAN infrastructure built on Nvidia AI Enterprise achieved carrier-grade 5G performance while using excess capacity to concurrently run AI inference workloads. These workloads included multimodal retrieval-automated generation (RAG) at the edge, robotics control, and autonomous vehicle remote support. SoftBank is calling the trial ‘AITRAS.’
In inferencing, pre-trained AI models interact with previously unseen data for predicting and decision-making. Edge computing moves this closer to data sources to hasten the process.
Garcia pointed out that the concept of edge intelligence has emerged in the last 18 months following the launch of ChatGPT. It pulls together enterprise edge (data centers), operational edge (physical branches), engagement edge (where enterprises interact with consumers) and provider edge (where AI-RAN sits).
This new partnership represents a trend in the market of “bringing AI as close as possible to the enterprise. Enterprises rely on providers for infrastructure for not only running model training, but also inferencing,” Garcia said.
Converting from cost center to revenue-generating asset
Traditional RAN infrastructure is designed using customized chips (application-specific integrated circuits) built solely for running RAN. By contrast, as Nvidia’s Vasishta explained, RAN and AI workloads built on Nvidia infrastructure are software-defined, and can be orchestrated or provisioned according to need.
This can accelerate the 5G software stack compliant with 5G standards to the same level, and in some cases exceeding, the performance/wattage of traditional RAN infrastructure, he said.